Fast Detection of Overlapping Communities via Online Tensor Methods

نویسندگان

  • Furong Huang
  • Mohammad Umar Hakeem
  • Animashree Anandkumar
چکیده

We present a fast tensor-based approach for detecting hidden overlapping communities under the mixed membership stochastic block (MMSB) model. We present two implementations, viz., a GPU-based implementation which exploits the parallelism of SIMD architectures and a CPU-based implementation for larger datasets, where the GPU memory does not suffice. Our GPU-based implementation involves a careful optimization of storage and communication, while in our CPU-based implementation, we perform sparse linear algebraic operations to exploit the data sparsity. We use stochastic gradient descent for multilinear spectral optimization and this allows for flexibility in the tradeoff between node sub-sampling and accuracy of the results. We validate our results on datasets from Facebook, Yelp and DBLP, where ground truth is available, using notions of p-values and false discovery rates, and obtain high accuracy for membership recovery. We compare our results, both in terms of execution time and accuracy, to the state-of-the-art algorithms such as the variational method, and report many orders of magnitude gain in the execution time. For instance, for DBLP dataset with about a million nodes and 16 million edges, the execution time is about two minutes. 1 Summary of Contributions Studying community formation is an important problem in social networks. A community generally refers to a group of individuals with shared interests or beliefs (e.g. music, sports, religion), or relationships (e.g. friends, co-workers). In a social network, we can typically observe and measure the interactions among the actors, but not the communities they belong to. A challenging problem is then to estimate the communities of the actors using only their observed interactions. In general, actors can participate in multiple communities, and detecting overlapping communities is even more challenging. Our goal is to design algorithms which can accurately detect overlapping communities, and yet be easily parallelizable for fast and scalable implementation on large graphs with millions of nodes. Moreover, we learn a probabilistic community model which allows us to carry out prediction tasks such as link classification. In this work, we present a fast approach for detecting overlapping communities under the mixed membership stochastic block model (MMSB) [1]. It is based on estimating tensors from subgraph counts such as 3-stars in the observed network, and then performing linear algebraic operations (e.g. SVD), and an iterative stochastic gradient descent method for tensor decomposition using implicit trilinear operations. We present two implementations, viz., a GPU-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Detection of Overlapping Communities via Online Tensor Methods on GPUs

We present a scalable tensor-based approach for detecting hidden overlapping communities under the mixed membership stochastic block model. We employ stochastic gradient descent for performing tensor decompositions, which provides flexibility to tradeoff node sub-sampling with accuracy. Our GPU implementation of the tensor-based approach is extremely fast and scalable, and involves a careful op...

متن کامل

Overlapping Community Detection in Social Networks Based on Stochastic Simulation

Community detection is a task of fundamental importance in social network analysis. Community structures enable us to discover the hidden interactions among the network entities and summarize the network information that can be applied in many applied domains such as bioinformatics, finance, e-commerce and forensic science. There exist a variety of methods for community detection based on diffe...

متن کامل

Detecting Overlapping Communities in Social Networks using Deep Learning

In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...

متن کامل

Identification of Overlapping Communities via Constrained Egonet Tensor Decomposition

Detection of overlapping communities in real-world networks is a generally challenging task. Upon recognizing that a network is in fact the union of its egonets, a novel network representation using multi-way data structures is advocated in this contribution. The introduced sparse tensor-based representation exhibits richer structure compared to its matrix counterpart, and thus enables a more r...

متن کامل

Utilizes the Community Detection for Increase Trust using Multiplex Networks

Today, e-commerce has occupied a large volume of economic exchanges. It is known as one of the most effective business practices. Predicted trust which means trusting an anonymous user is important in online communities. In this paper, the trust was predicted by combining two methods of multiplex network and community detection. In modeling the network in terms of a multiplex network, the relat...

متن کامل

A tensor approach to learning mixed membership community models

Community detection is the task of detecting hidden communities from observed interactions. Guaranteed community detection has so far been mostly limited to models with nonoverlapping communities such as the stochastic block model. In this paper, we remove this restriction, and provide guaranteed community detection for a family of probabilistic network models with overlapping communities, term...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013